Fast Incremental SVDD Learning Algorithm with the Gaussian Kernel

نویسندگان

  • Hansi Jiang
  • Haoyu Wang
  • Wenhao Hu
  • Deovrat Kakde
  • Arin Chaudhuri
چکیده

Support vector data description (SVDD) is a machine learning technique that is used for single-class classification and outlier detection. The idea of SVDD is to find a set of support vectors that defines a boundary around data. When dealing with online or large data, existing batch SVDD methods have to be rerun in each iteration. We propose an incremental learning algorithm for SVDD that uses the Gaussian kernel. This algorithm builds on the observation that all support vectors on the boundary have the same distance to the center of sphere in a higher-dimensional feature space as mapped by the Gaussian kernel function. Each iteration involves only the existing support vectors and the new data point. Moreover, the algorithm is based solely on matrix manipulations; the support vectors and their corresponding Lagrange multiplier αi’s are automatically selected and determined in each iteration. It can be seen that the complexity of our algorithm in each iteration is only O(k), where k is the number of support vectors. Experimental results on some real data sets indicate that FISVDD demonstrates significant gains in efficiency with almost no loss in either outlier detection accuracy or objective function value.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Learning Algorithm for Support Vector Data Description

Support vector data description (SVDD) has become a very attractive kernel method due to its good results in many novelty detection problems.Training SVDD involves solving a constrained convex quadratic programming,which requires large memory and enormous amounts of training time for large-scale data set.In this paper,we analyze the possible changes of support vector set after new samples are a...

متن کامل

The Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description

Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice ...

متن کامل

An Algorithm of Unsupervised Posture Clustering and Modeling Based on GMM and EM Estimation

Support vector data description (SVDD) has become a very attractive kernel method due to its good results in many novelty detection problems.Training SVDD involves solving a constrained convex quadratic programming,which requires large memory and enormous amounts of training time for large-scale data set.In this paper,we analyze the possible changes of support vector set after new samples are a...

متن کامل

Ellipse Support Vector Data Description

This paper presents a novel Boundary-based approach in one-class classification that is inspired by support vector data description (SVDD). The SVDD is a popular kernel method which tries to fit a hypersphere around the target objects and of course more precise boundary is relied on selecting proper parameters for the kernel functions. Even with a flexible Gaussian kernel function, the SVDD cou...

متن کامل

Unsupervised Nonlinear Feature Extraction Method and Its Effects on Target Detection in High-dimensional Data

The principal component analysis (PCA) is one of the most effective unsupervised techniques for feature extraction. To extract higher order properties of data, researchers extended PCA to kernel PCA (KPCA) by means of kernel machines. In this paper, KPCA is applied as a feature extraction procedure to dimension reduction for target detection as a preprocessing on hyperspectral images. Then the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1709.00139  شماره 

صفحات  -

تاریخ انتشار 2017